per 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBL ISHED UNDER THE PATENT COOPERATION TREATY (PCT) 
(51) International Patent Classification 5 : 



C07K 13/00, C12N 15/31, 15/62 ; 
A61K 37/02, C07K 3/18 



Al 



(11) International Publication Number; WO 93/22342 

(43) International Publication Date: 1 1 November 1993 (1 1.1 1.93) 



(21) International Application Number: PCT/SE93/00375 

(22) International Filing Date: 28 April 1993 (28.04.93) 



(30) Priority data: 
9201331-7 



28 April 1992 (28.04.92) 



SE 



(71) Applicant (for all designated States except US): HIGHTECH 

RECEPTOR AB [SE/SE]; c/o Active, Skeppsbron 2, S- 
211 20 Malm6 (SE). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only) : BJORCK, Lars [SE/ 
SE]; Kornvagen 40, S-240 17 S6dra Sandby (SE). SJO- 
BRING, Ulf [SE/SE]; Lilla Sigridsgatan 1, S-223 50 
Lund (SE). 

(74)Agents: BERG, S., A. et al.; H. Albihns Patentbyra AB, 
" Box 3137, S-103 62 Stockholm (SE). . 



(81) Designated States: JP, US, European patent (AT, BE, CH, 
DE, DK, ES, FR, GB, GR, IE, IT, LU, MC, NL, PT, 
SE). 



Published 

With international search report. 



(54) Title: PROTEIN L AND HYBRID PROTEINS THEREOF 



(57) Abstract 



The invention relates to sequences of protein L which bind to light chains of immunoglobulins. The invention also relates 
to hybrid proteins thereof which are able to bind to both light and heavy chains of immunoglobulin G, in particular protein LG. 
The invention also relates to DNA-sequences which code for the proteins, vectors which include such DNA-sequences, host cells 
which have been transformed with the vectors, methods for producing the proteins, reagent appliances for separation and ident- 
ification of immunoglobulins, compositions and pharmaceutical compositions which contain the proteins. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States par.y to the PCX on the front pages of pamphlets publishing international 

applications under the PCF. 



AT 


Austria 


. FR 


AU 


Australia. 


GA 


BB 


Barbados 


GB 


BE 


Belgium 


CN 


BF 


Burkina Fasu 


GR 


BC 


Bulgaria 


HU 


BJ 


Benin 


IE 


BR 


Bra/tl 


IT 


CA 


Canada 


JP 


CF 


Central African Republic 


KP 


CC 


Congo 


KR 


CH 


Switzerland 


a 


Cote d'l voire 


K'l 


CM 


Cameroon 


I.I 


CS 


Czechoslovakia 


LK 


CZ 


CVech Republic 


I.U 


DE 


Germany 


MC 


DK 


Denmark 


MC 


ES 


Spain 


Ml. 


Fl 


-Finland 


MN 



France 
Gabon 

United Kingdom 

Guinea 

Greece 

Hungary 

Ireland 

Italy 

Japan 

Democratic Peopled Republic 
of Korea 
. Republic of Korea 
Kazakhstan 
Liechtenstein 
Sri 1 anka 
Luxembourg 
Monaco 
Madagascar 
Mali 

Mongolia 



MR 


Mauritania 


MW 


Malawi 


NL 


Netherlands 


NO 


Norway 


NZ 


New Zealand 


PL 


- Poland 


PT 


Portugal 


RO 


Rumania 


RU 


Russian Federation 


SO 


Sudan 


SE ' 


Sweden 


SK 


Slovak Republic 


SN 


Senegal 


su 


Soviet Union 


TD 


Chad 


TC 


"logo 


UA 


Ukraine 


US 


United States of America 


VN 


Viet Nam 



WO 93/22342 PCT/SE93/00375 

1. 

Protein L and Hybrid Proteins Thereof 

The present invention relates to sequences of protein L 
which bind to light chains of immunoglobulins. The 

5 invention also relates to hybrid proteins of protein L 

having the ability to. bind to light chains of all Ig and 
also to bind to light and heavy chains of immunoglobulin 
G, DNA-sequences which code for the proteins vectors 
that contain such DNA-sequences, host cells transformed 

10 by the vectors, methods for preparing the proteins, 

reagent apparatus for separating and identifying immuno- 
globulins, compositions and pharmaceutical compositions 
which contain the proteins. 

15 The invention relates in particular to the DNA-sequence 
and to the amino acid sequence of the light-chain form- 
ing domains of protein L. 

Proteins which bind to the constant domains (of high 
20 affinity) of the immunoglobulins (Ig) are known. Thus, 

protein A (from Hi-*nhvlococcus aureus ) (Forsgren, A. and 
SjSquist, J. (1966) Protein A from staphylococcus 
aureus. I. Pseudo-immune reaction with human gamma- 
globulin. J. Immunol. 97: 822-827) binds to IgG from 
25 various mammal species. The binding of protein A to IgG 
is mediated essentially via surfaces in the Fc-fragment 
of the heavy chain of the IgG-molecule, although a 
certain bond is also effected with surfaces in the Fab- 
fragment of the IgG. Protein A lacks the ability of 
30 binding to human IgG3 and neither will it bind to IgG 
from several other animal species, such as important 
laboratory animals, for instance rats and goats, which 
limits the use of protein A. 

35 Protein G (Bj6rck, L. and Kronvall, G. (1984) Purifica- 
tion and some properties of streptococcal protein G, a 
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novel IgG-binding reagent, J. Immunol. 133: 969-974; 
Reis, K., Ayoub, E- and Boyl , M. (1984) Streptococcal 
Fc receptors. I. Isolation and partial characterization 
of the receptor from a group C streptococcus. J. 
5 Immunol. 132: 3091-3097) binds to heavy chains in human 
IgG and to all four of its subclasses and also to IgG 
from most mammals , including rats and goats. 

Protein H (Akesson, P., Cooney, J. , Kishimoto, F. and 
10 Bjorck, L. (1990) Protein H - a novel IgG. binding bacte- 
rial protein. Molec. Immun. 27: 523-531) binds to the 
Fc-fragment in IgG from human beings, monkeys and rab- 
bits. However , the bond is weaker than in the case of 
protein G and A f which may be beneficial when wishing to 
15 break the bond with a weak agent, for instance when 

purifying proteins which are readily denatured with the 
aid of antibodies. 

Protein M (Applicant's Patent Application PCT/SE 
20 91100447) binds to the Fc-fragment in IgG from humans, 
monkeys, rabbits, goats, mice and pigs. 

Protein L (Bjorck, L, (1988) Protein L, a novel bacteri- 
al cell wall protein with affinity to Ig L chains. , J. 
25 Immunol. 140: 1194-1197), which binds to the light 

chains in immunoglobulins from all of the classes G, A, 
M, D and E is known (USP 4,876,194). The amino acid se- 
quence and the binding domains of this protein, however, 
have hitherto been unknown. 

30 

The aforesaid proteins can be used in the analysis, 
purification and preparation of antibodies and for 
diagnostic and biological research. 

35 The elimination of immunoglobulins, with the aid. of 



SUBSTITUTE SHEET 



WO 93/22342 



PCT/SE93/0037S 



plasmapheresis, can have a favourable effect on some 
autoimmune diseases. A broadly binding protein would be 
an advantage when wishing to eliminate all classes of 
antibodies in this context. 

5 

It has long been known that infectious conditions can be 
prevented or cured with the introduction of an immune 
serum, i.e. a serum which is rich in antibodies against 
the organism concerned or its potentially harmful pro- 

10 duct. Examples hereof are epidemic jaundice, tetanus, 

diphtheria, rabies and generalized shingles. Antibodies 
against a toxic product may also be effective in the 
case of non- infectious occasioned conditions. Serum 
produced in animals against different snake venoms is 

15 the most common application in this respect. However, 
the administration of sera or antibody preparations is 
not totally without risk. Serious immunological reac- 
tions can occur in some cases. Singular cases of the 
transmission of contagious diseases, such as HIV and 

20 hepatitis through the agency of these products have also 
been described. In order to avoid these secondary ef- 
fects, it has been desirable to produce therapeutic 
antibodies in test tubes. A large number of novel tech- 
niques for the preparation of antibodies in test tubes 

25 have been proposed in recent years. Examples of such 

techniques are hybridom techniques, synthesis of chima- 
antibodies and the preparation of antibodies in bacte- 
ria. These techniques also enable antibodies to be 
specially designed which can further widen the use of 

30 such molecules as therapeutics, for instance in the case 
of certain tumour-diseases . In the case of some of these 
novel methods, however, the product totally lacks the 
Fc-fragment to which all of the described IgG-binding 
proteins, with the exception of protein L, bind. There 

35 is consequently a heed of a process for purifying anti- 
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bodies for therapeutic use, wherein proteins which have 
a broad binding activity/specificity, can be of value. 

It has long been possible to utilize the antibody reac- 
5 tion with its high grade specificity for diagnosing past 
or, in some cases, ongoing infections with different 
parasites ... This indirect method of indicating infectious 
agents is called serology and, in many cases , may be the 
only diagnostic alternative. In certain cases, it can 

10 also be of interest to exhibit specific IgE- or IgA- 
antibodies. When diagnosing with the aid of serology, 
the antigen is most often fastened to a solid phase, 
whereafter serum taken from the patient is incubated 
with the antigen. Antibodies that have been bound from 

15 the patient can then be detected in different ways, 
often with the aid of a secondary antibody (for in- 
stance, an antibody which is directed against the light 
chains of human antibodies) to which an identifiable 
label has been attached, such as alkaline phosphatase, 

20 biotin, radioactive isotopes, fluorescein, etc. In this 
context, a protein having a broad Ig binding capacity 
can be used as an alternative to secondary antibodies. 

There are a number of non-therapeutic and non-diagnostic 
25 reasons for the necessity to bind antibodies. Antibodies 
are often used in research r both for detection and for 
purifying the antigen against which they are directed. 
All techniques which facilitate the purification of 
antibodies and, in particular, techniques which enable 
30 different classes to be purified, are of interest in 
this context. 

Consequently, there is a serious need of a protein which 
has a broad binding activity/ specificity and which binds 
35 to several different classes of immunoglobulins from 

different animal species. At present, there is no known 
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protein which will bind to all immunoglobulin classes. 
The earlier known proteins A f G, H and M bind only to 
heavy chains in IgG. The known protein L (BjSrck et al, 
1988) binds to the light x-chains and 7-chains in im- 
5 munoglobulins of all classes, although the bonds are 

much weaker on the /c-chains. Applicant has charted pro- 
tein L, has determined the amino acid sequence for 
protein L, has identified the light-chain binding do- 
mains on protein L, and has used these to produce hybrid 

10 proteins which possess the IgG^Fc-binding domains of 

protein G. The Applicant is able to show through protein 
LG that a protein of broader binding activity/ 
specificity can be produced thereby. The aforesaid 
proteins A, G, H and M bind to the same surfaces, or to 

15 very closely lying surfaces on IgG-Fc. The protein L 
which binds to light chains can thus be combined with 
any other functionally similar protein which binds to 
the Fc-fragment of heavy chains. A similar broadening of 
the Ig-binding activity is achieved with all alterna- 

20 tives. 

Thus, the present invention relates tb the sequence of 
protein L which binds to light chains in Ig and has the 
amino acid sequence disclosed in Figure 1, and variants, 
25 subf ragments , multiples or mixtures of the domains B1-B5 
having the same binding properties. The invention also 
relates to a DNA-sequence which codes for such protein 
sequences, for instance the DNA-sequence in Figure 1. 

30 The invention is concerned with a hybrid protein which 

is characterized by comprising domains which bind to the 
light x-chains and X-chains in immunoglobulins of all 
classes, and also comprises domains which bind to heavy 
chains in immunoglobulin G, wherein those domains which 

35 bind to the light chains are chosen from among the B1-, 
B2-, B3-, B4- and B5-domains in protein L (see Claim 1) 
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and those domains which bind to heavy chains of immuno- 
globulins are chosen from th C1-, C2- and C3 -domains in 
protein G; the A-, B- and Cl-domains from protein H; the 
A-, B1-, B2- and S-domains in protein Ml or the £-, D-, 
5 A-, B- and C-domains in protein A (see Figure 6) and 

variants, subf ragments , multiples or mixtures of these 
domains that have the same binding properties which bind 
to heavy chains of immunoglobulins. 

10 By subfragment is meant a part-fragment of the given 

domains or fragments which include parts from the vari- 
ous domains having mutually the same binding properties. 
* By variants is meant proteins or peptides in which the 
original amino acid sequence has been modified or 

15 changed by insertion, addition, substitution, inversion 
or exclusion of one or more amino acids, although while 
retaining or improving the binding properties. The 
invention also relates to those proteins which contain 
several arrays (multiples) of the binding domains or 

20 mixtures of the binding domains with retained binding 
properties. The invention also relates to mixtures of 
the various domains of amino acid sequences having 
mutually the same binding properties. 

25 The invention relates in particular to a hybrid protein 
designated LG, and is characterized in that the hybrid 
protein includes the B-domains in protein L which bind 
to the light chains in immunoglobulins, and the Cl- 
domains and C2-domains in protein G which bind to heavy 

30 chains and have the amino acid sequence disclosed in 

Figure 3. The invention also relates to variants, sub- 
fragments, multiples or mixtures of these domains. 

Protein LG is a hybrid protein having a molecular weight 
35 of about 50 kDa (432 amino acids) and comprising four 

domains, each of which binds to light chains in immuno- 
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globulins, and two IgG-binding domains from protein G. 
The hybrid protein combines a br ad IgG-binding activi- 
ty, deriving from the high-grade binding ability of 
protein G to the Fc-fragment of the heavy chain on IgG 
5 with the ability of the protein L to bind to light 

chains of all classes of immunoglobulins. Thus, protein 

LG binds polyclonal human IgG, IgM, IgA, IgD and Ig'E. 

10 -1 

The affinity for human polyclonal IgG is 2 x 10 M 

All four human immunoglobulin classes are bound. Binding 

10 to human IgG is effected with both the K-and the X- 

chain. Both the Fc-fragment and the Fab-fragment of IgG 
are bound to the hybrid protein. The protein also binds 
human IgA-, IgD-, igE- and igM^antibodies . The bond is 
stronger to human immunoglobulins which carry x than to 

15 those which carry the X- isotope of light chains. IgG 

from most mammals will be bound by protein LG, thus also 
IgG from goats and cows, which do not bind to protein L. 
However, rabbit-IgG which binds relatively weakly to 
protein L will bind well to the fusion protein. IgM and 

20 IgA-antibodies from mice, rats and rabbits will be bound 
to the protein. 

Protein LG is highly soluble. It is able to withstand 
heat and will retain its binding properties even at high 

25 temperatures. The binding properties also remain in a 

broad pH-range of 3-10. The protein withstands detergent 
and binds marked or labelled proteins subsequent to 
separation in SDS-PAGE and transference to membranes 
with elektroblotting. The protein can be immobilized on 

30 a solid phase (nitrocellulose, Immobilon®, polyacryl- 
amide, plastic, metal and paper) without losing its 
binding capacity. The binding properties are not influ- 
enced by marking with radioactive substances, biotin or 
alkaline phosphatase. (The binding abilities of the 

35 protein LG are disclosed in Example 3). 
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The protein comprises 432 amino acids and has a moleeu-^ 
lar weight of 50 kDa deriving therefrom. The sequence is 
constructed of an ala sequence of the three last amino 
acids in the A-domain of the protein L (val-glu-asn) , 
5 this ala sequence being unrelated to the two proteins, 
whereafter the four mutually high-grade homologous B- 
domains from protein. L follow. The first of the B-do- 
mains is comprised of 76 amino acids, and the remaining 
domains are each comprised of 72 amino acids. The first 

10 nine amino acids from the fifth B-domain are included 
and followed by two non-related amino acids (pro-met). 
The protein G-sequences then follow. The last amino acid 
in the so-called S-domain from protein G is followed by 
an IgG-binding domain from protein G (CI; 55 amino 

15 acids) , the intermediate D-region (15 amino acids) and r 
the second IgG-binding C-domain (C2; 55 amino 
acids). The last amino acid is a methionine, which 
occurs in natural protein G as the first amino acid in 
the so-called W-region. 

20 

The invention also relates to DNA-sequences which code 
for the aforesaid proteins. 

The gene which codes for the IgG-binding amino acid 
25 sequences can be isolated from the chromosomal DNA from 
Staphylococcus aureus based on the information on the 
DNA-sequence for protein A (S. Lofdahl, B. Guss, M. 
Uhlen, L. Philipsson and M. Lindberg. 1983. Gene for 
staphylococcal protein A. Proc. Natl. Acad. Sci. USA. 
30 80: 697-701) and Figure 6, or from G-streptococcus, 

preferably strain G 148 or C-str eptococcus r preferably 
strain Streptococcus equisimilis C 40, based on the 
information on protein G (B. Guss, M. Eliasson, 
A. Olsson, M. Uhlen, A.-K. Frej, H. JSrvall, I. Flock 
35 and M. Lindberg. 1986. Structure of the IgG-binding 
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regions of streptococcal protein G. EMBO. J. 5: 1567- 
1575) and Figure 6, or from group A-streptococcus , 
e.g. S . pyogenes (type Ml) based on the information on 
the DNA-sequence for protein H (H. Gomi, T. Hozumi, 
5 S. Hattori, C. Tagawa, F. Kishimoto and L. Bjorck. 1990. 
The gene sequence and some properties of protein H - a 
novel IgG binding protein J. Immunol. 144: 4046-4052) 
and Figure 6, or from the chromosomal DNA in group A- 
streptococcus type Ml based on the information on the 

10 DNA-sequence for protein M (Applicant's Patent Applica- 
tion, PCT/SE 91100447) and Figures 6 and 7. The gene 
which codes for the protein that binds to light chains 
can be isolated from the chromosomal DNA from Pepto-co- 
ccus macmus 312 based on the information on the DNA- 

15 sequence for protein L in Claim 2. 

By using the chromosomal DNA't obtained from the afore- 
said bacteria as a template, a DNA-fragment defined with 
the aid of two synthetic oligonucleotides can then be 

20 specifically amplified with the aid of PCR (Polymerase 
Chain Reaction) • This method also enables recognition 
sites to be incorporated for restriction enzymes in the 
ends of the amplified fragments (PCR technology, Ed: PCR 
Technology. Principles and Applications for DNA Amplifi- 

25 cation. Ed. Henry Erlich. Stockton Press, New York, 

1989) . The choice of recognition sequences can be adapt- 
ed in accordance with the vector chosen to express the 
fragment or the DNA-fragment or other DNA-fragments with 
which the amplified fragment is intended to be combined. 

30 The amplified fragment is then cleaved with the restric- 
tion enzyme or enzymes concerned and is combined with 
the fragment /the other fragments concerned and the 
fragments are then cloned together in the chosen vector 
(in this case, the expression vector) (Sambrook, J.E. 

35 Fritsch and T. Maniatis, 1989, Molecular cloning: A 

laboratory manual, 2nd Ed. Cold Spring Harbor Labor ato- 



SUBSTITUTE SHEET 



WO 93/22342 



PCT/SE93/00375 



10 

ries, Cold Spring Harbor, New York, USA). The plasmid 
* vector pHD3 13 can be used (Dalboge, H,E. Bech Jensen, 
H. TSttrup, A. Grubb, M. Abrahamson, I. Olafsson and S. 
Carlsen, 1989. High-level expression of active human 
5 cystatin C in Escherichia coli. Gene, 79: 325-332), 

alternatively one of the vectors in the so-called PET- 
series (PET 20, 21, 22, 23) retailed by Novagen (Madi- 
son, Wisconsin, USA) . - ,~ ^ . 

10 The hybrid proteins are then incorporated in an appro- 
priate host, preferably E. coli. The invention also 
relates to such hosts as those in which the hybrid 
proteins are incorporated. 

15 Those clones which produce the desired proteins can be 
selected from the resultant transf ormants with the aid 
of a known method (Fahnestbck et al. , J. Bacterid. 167, 
870 (1986). 

20 When the proteins that can bind to the light chains in 
the immunoglobulins and to the heavy chains in IgG have 
been purified from the resultant positive clones with 
the aid of conventional methods, the binding specifici- 
ties of the proteins are determined for selection of 

25 those clones which produce a protein that will bind to 
the light chains in immunoglobulins and to the heavy 
chains in IgG. 

Subsequent to having isolated plasmid DNA't in said 
30 clone with conventional methods, the DNA-sequence in the 
inserted material is determined with known methods 
(Sanger et al. , Proc. Natl. Acad. Sci. USA 74, 5463 
(1977) . 

35 The invention also relates to DNA-sequences which hy- 
bridize with said identified DNA-sequences under conven- 
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tional conditions and which code for a protein that 
possesses the desired binding properties. Strict hybrid- 
izing conditions are preferred. 

5 Expression of the genes can be effected with expression 
vectors which have the requisite expression control 
regions, the structural gene being introduced after said 
regions. As illustrated in Figure 1 and Claim 2, the 
structural gene can be used for protein LG or other 
10 hybrid proteins with protein L. 

With regard to expression vectors, different hbst-vec- 
tor-systems have been developed, of which the most 
suitable host-vector-systems can be selected for expres- 
15 sion of the genes according to the present invention. 

The present invention also relates to a method of pro- 
ducing the inventive hybrid proteins by cultivating a 
host cell which is transformed with an expression vector 
20 in which DNA't which codes for the proteins according to 
the invention is inserted. 

This method includes the steps of 

25 (1) inserting into a vector a DNA-fragment which codes 
for the hybrid proteins; 

(2) transforming the resultant vector into an appropri- 
ate host cell; 

30 

(3) cultivating the resultant, transformed cell for 
preparation of the desired hybrid protein; and 

(4) extracting the protein from the culture. 
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In the first step, the DNA- fragment which codes for th 
hybrid protein is inserted in a vector which is suitable 
for the host that is to be used to express the hybrid 
protein. The gene can be inserted by cleaving the vector 
5 with an appropriate restriction enzyme, and then legat- 
ing the gene with the vector. 

In the second step, the vector with the hybrid plasmid 
is inserted into host cells. The host cells may be 
10 Escherichia . coli , Baci 1 lus subtilis or Sacpharomyces 
cerevisiae or other suitable cells. Transformation of 
the expressions hybrid vector into the host cell can be 
effected in a conventional manner and clones which have 
been transformed can then be selected. 

15 

in the third step, the obtained transf ormants are culti- 
vated in an appropriate medium for preparation of the 
desired proteins by expression of the gene coded for the 
hybrid protein. 

20 

In the fourth step, the desired protein is extracted 
from the culture and then purified. This can be achieved 
with the aid of known methods. For instance , the cells 
can be lysed with the aid of known methods , by treating 

25 the cells with ultrasonic sound, enzymes or by mechani- 
cal degradation. The protein which is released from the 
cells or which excretes in the medium can be recovered 
and purified with the aid of conventional methods often 
applied within the biochemical field, such as ion-ex- 

30 change chromatography, gel filtration, affinity chroma- 
tography with the use of immunoglobulins as ligands, 
hydrophobic chromatography or reverse-phase chromato- 
graphy. These methods can be applied individually or in 
suitable combinations. ^ 
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As before mentioned, the inventive proteins may be used 
for binding, identifying or purifying immunoglobulins. 
They can also be bound to pharmaceuticals and used in 
formulations which have delayed release properties. To 
5 this end, the protein may be present in a reagent appli- 
ance for pharmaceutical composition in combination with 
appropriate reagents, additives or carriers. 

The proteins can be handled in a freeze-dried state or 
10, in a PBS-solution (phosphate-buffered physiological salt 
solution) pH 7.2 with 0.02% NaN^. It can also be used 
connected to a solid phase, such as carbohydrate-based 
phases, for instance CNBr-activated sepharose, agarose, 
plastic sur faces, polyacrylamide, nylon, paper, magnetic 
15 spheres, filter, films. The proteins may be marked with 
biotin, alkaline phosphatase, radioactive isotopes, 
fluorescein and other fluorescent substances, gold 
particles, ferritin, and substances which enable lumi- 
nescence to be measured. 

20 

Other proteins may also be used as carriers. These 
carriers may be bound to or incorporated in the pro- 
teins, in accordance with the invention. For instance, 
it is conceivable to consider the whole of proteins A, 
25 G, H, M as carriers for inserted sequences of protein L 
which bind to light chains. In turn, these carriers can 
be bound to the aforesaid carriers. 

The pharmaceutical additions that can be used are those 
30 which are normally used within this field, such as 

pharmaceutical qualities of mannitol, lactose, starch, 
magnesium stearate, sodium saccharate, talcum, cellu- 
lose, glycose, gelatine, saccharose, magnesium carbonate 
and similar extenders, such as lactose, dicalcium phos- 
35 phate and the like; bursting substances, such as starch 
or derivatives thereof; lubricants such as magnesium 



SUBSTITUTE SHEET 



WO 93/22342 PCT/SE93/00375 



5 



10 



15 



25 



14 



stearate and the like; binders, such as starch, gum 
aribicum, polyvinylpyrrolidone, gelatine, cellulose and 
derivatives thereof, and the like. 

The invention will now be described in more detail with 
reference to the accompany drawings, in which 

Figure l illustrates the plasmid P HD389; the ribosomal 
binding sequence, the sequence for the signal peptide 
from ompA and recognition sequence for several restric- 
tion enzymes are shown; 

Figure 2 illustrates the amino acid and nucleic acid 
sequence for protein LG; 



Figure 3 is a schematic overall view of the production 
of protein L; 

Figure 4 is a schematic overall view of the production 
20 of protein LG; 

Figures 5a, 5b and 5c are schematic overall views of the 
production of the hybrid proteins LA, LM and LH respec- 
tively; 



Figure 6 is a schematic inclusive illustration of pro- 
tein A, G, H and Ml. IgGFc-binding domains are for 
protein A: E, D, A, B and C;. for protein G: CI, C2 and 
C3; for protein H: A and/or B; and for protein Ml: A, 
30 Bl, B2, B3 and S; 

Figure 7 illustrates the amino acid and nucleic acid se- 
quence for protein Ml; 
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Figure 8 illustrates Western Blot for protein G, L and 
LG with certain immunoglobulins and immunoglubulin 
f r agments ; and 

5 Figure 9 illustrates Slot-Blot for protein L, G and LG 
with IgG, Igx and Ig Fc. 

The amino acid and nucleic acid sequence of the light- 
chain binding domains of protein L is illustrated in 
10 Claims 1 and 2 respectively. 

It will be observed that the drawings are not to scale. 

Cloning and expression of the IgG-light-chain-binding 
domains in Protein L — . — ^_ 

Construction of synthetic oligonucleotides (primers) for 
20 amplifying sequences coded for protein L, domain B1-B4 

It has been found that a protein L peptide (expressed in 
E. coli ) constructed of the sequence ala-val-glu-asn- 
domain Bi (from protein L) binds to the light chains of 

25 the immunoglobulins (W. Kastern, U. Sjbbring and L. 

Bjfirck. 1992. Structure of peptostreptococcal protein L 
and identification of a repeated immunoglobulin light 
chain-binding domain. J. Biol. Chem. in-print) . Since 
this simple protein L-domain has a relatively low aff in- 

30 ity to Ig, (1 x 10 ? m" 1 ) , and since the naturally occur- 
ring protein L which is constructed of several mutually 
similar domains (B1-B5) has a high affinity to Ig (1 x 
10 10 m" 1 ) four of these domains have been expressed 
together in the following way: 

35 
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PL-N and PL-Cl are synthetic olig nucleotides (manufac- 
tured by the Biomolecular Unit at Lund University (Swe- 
den) in accordance with Applicant's instructions) which 
have been used to amplify a clonable gene fragment which 

5 is amplified with PCR (Polymerase Chain Reaction) and 

which codes for four Ig-binding protein L domains (ala- 

V ai-giu-asn-Bi-B2-B3-B4-lys-lys-val-asp-glu-lys-pro-glu- 

' glu) . Amino acids in the protein L-sequence are given 
for the primer which corresponds to the coded strand 
10 (PL-N) : 

• PL-N: 5 ' -GCTCAGGCGGCGCCGGTAGAAAATAAAGAAGAAACACCAGAAAC-3 ' 

valgluasnlysglugluthrproglu 



15 



20 



25 



30 



35 



5'-end of this oligonucleotide is homologous with the 
coded strand in the protein L-gene (emphasized) : those 
codons which code for the last three amino acids in the 
A-domain (val-glu-asn) are followed by the codons for 
the first six amino acids in the first of the Ig-binding 
domains in protein L (Bl) . 

PL-Cl : 5 ' -CAGCAGCAGGAT!TC22SmTTCTTCTGGTTT^ 
CTT-3' 

This oligonucleotide is homologous with the opposing 
non-coding strand in the gene for protein L (the se- 
quence corresponds to the first nine amino acids in 
domain B5) . 

DNA-fragments which have been amplified with the aid of 
PL-N contain the recognition sequence for the restric- 
tion enzyme Hpall (emphasized) immediately before the 
codon which is considered to code for the first amino 
acid (val) in the expressed protein L-fragment. The 
fragment which is cleav d with Hpall can be ligated with 
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DNA (in this case, consisting of the used expression 
vector pHD389) which has been cleaved with the restric- 
tion enzyme Narl. The DNA-fragment that has been cleaved 
with Hpall and ligated with vector pHD389, which has 
5 been cleaved with Narl, will be translated in the cor- 
rect reading frame. The construction results in trans- 
lation of an additional amino acid (ala) immediately in 
front of the first amino acid in protein L. 

10 DNA-fragments which have been amplified with the aid of 
PL-C1 will contain the recognition sequence for the 
restriction enzyme BamHI (over lined above the sequence) 
immediately after the sequence which codes for the last 
amino acid in the expressed protein L-f ragment (glu) - 

15 The vector pHD389 contains a unique recognition sequence 
for BamHI as part of its so-called multiple cloning 
sequence which follows the Karl recognition sequence. 
DNA-fragments which have been amplified with the aid of 
PL-C1 will include two so-called stop-codons (empha- 

20 sized) which results in translation of the fragment 
inserted in the vector to cease. 

The sequence which was considered to be amplified con- 
tains no internal recognition sequences for the restric- 
25 tion enzymes Hpall or BamHI. 

Amplifying and cloning procedures 

(PCR) (Polymerase Chain Reaction) was effected with a 
30 protocol described by Saiki, R.D. Gelfand, S. Stoffel, 

S. Scharf , R. Higuchi, G. Horn, K. Mullis and H. Erlich, 
1988; Primer-directed enzymatic amplification of DNA 
with a thermostable DNA polymerase. Science 239: 487- 
49127; PCR was effected in a Hybaid Intelligent Heating- 
35 block (Teddington, UK) : 100 jil of a reaction mixture 
contained 50 mM KC1, 10 mM Tris-HCl, pH 8.3, 1.5 mM 
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MgCl , 100 fi/ml gelatine, 300 (M with respect to each of 
the deoxynucleotides (dATP, dCTP, dGTP, dTTP) , (Pharma- 
cia), 20 pmol of each of the oligonucleotides PL-N and 
PL-C1, 10 nl of a target (template) DNA-solution con- 
5 taining 0.1 mg/ml of chromosomal DNA f rom Peptoz. 

streptococceus maonus . species 312. The mixture was 
covered with mineral oil (Sigma) and DNA't was denatured 
- by heating to "98 °C for 10- minutes *~2v5 -units of AmpliTaq 
(Perkin Elmer Cetus, Norwalk, CT) were added and PCR was 
10 then carried out with 25 cycles consisting of a denatur- 
ing step at 94 °C for 1 minute, followed by a hybridizing 
step at 56"C for 1 minute, and finally by an extension 
step at 72 »C for 1 minute. Amplified DMA was analyzed by 
electrophoresis in agarose gel. The amplified DNA't was 
15 cleaved with the restriction enzymes Hpall (Promega) , 
(8 units/jig amplified DNA) and BamHI (Promega) , (10 
units//tg amplified DNA) at 37°C. The thus amplified and 
subsequently cleaved DNA-product was isolated by elec- 
trophoresis in a 2% (weight by volume) agarose gel 
20 (NuSieve agarose, FMC Biproducts) in a TAE-buffer (40 Mm 
Tris, 20 Mm Na-acetate, 2 Mm EDTA, Ph 8.0). The result- 
ing 930 base-pair large fragment was cut from the gel. 
The DNA concentration in this removed gel-piece was 
estimated to be 0.05 mg/ml. The agarose-piece containing 
25 the cleaved, amplified fragment was melted in a water 
bath at 65 °C, whereafter the fragment was allowed to 
cool to 37 »C. 10 Ml (0.5 Hq) of this DNA was transferred 
to a semimicrotube (Sarstedt) preheated to 37 °C, where- 
after 1 Ml of the vector pHD389 was immediately added 
30 and cleaved with Narl (Promega) and BamHI, 1 Ml 

lOxligas-buf f er (Promega and 1 Ml T4 DNA-ligase (Pro- 
mega? 1 unit/Ml). The ligating reaction was then used to 
transform E. coli . strain LE392, which had been compe- 
tent in accordance with the rubidium/ calcium-chloride- 
35 method as described by Kushner (1978) . Molecular biolog- 
ical standard methods have been used in the manipulation 
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of DNA (Sambrook, J.E. Fritsch and T. Maniatis, 1989. 
Molecular cloning: A laboratory manual. 2nd Ed. Cold 
Spring Harbor Laboratories, Cold Spring Harbor, New 
York, USA). The cleaving and ligating conditions recom- 
5 mended by the manufacturer of DNA-ligase and restriction 
enzymes have been followed in other respects. 

Expression system 

10 The vector pHD389 (see Figure 2) is a modified variant 
of the plasmid pHD313 (Dalboge, H.E. Bech Jensen, H. 
T6ttrup> A. Grubb, M. Abrahamson, I. Olafsson and 
S. Carlsen, 1989. High-level expression of active human 
cystatin C in Escherichia coli. Gene, 79: 325-332). The 

15 vector, which is replicated in E. coli (contains or i = 

origin of replication from plasmid pUC19) is constructed 
so that DNA-fragments which have been cloned into the 
cleaving site of Narl will be transcribed and translated 
downstream of and in the immediate vicinity of the 

20 signal peptide (21 amino acids) , from envelope-protein 
ompA from E. coli . • Translation will be initiated from 
the codon ATG which codes for the first amino acid 
(methionine) in the signal peptide. This construction 
permits the translated peptide to be transported to the 

25' periplasmic space in E. coli . This is advantageous, 

since it reduces the risk of degradation of the desired 
product of enzymes occurring intracellular ly in E. coli. 
Moreover, it is easier to purify peptides which have 
been exported to the periplasic space. Unique recog- 

30 nition sequences (multiple cloning sequences) for sever- 
al other restriction enzymes, among them ecoRX, Sail and 
BamHI are found immediately after the Narl cleaving 
site. An optimized so-called shine-Dalgarno-sequence 
(also called ribosomal binding site, RBS) is found seven 

35 nucleotides upstream from the ATG-codon in the signal 

sequence from ompA, this optimized sequence binding to a 
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complementary sequence in 16S rRNA in the ribosomes and 
* is responsible for the translation being initiated in 
the c rrect place. The transcription of such DNA as that 
which is co-transcribed with the signal sequence for 
5 ompA is controlled by the P_-promotor from coliphage X. 
The vector also contained the gene for cI857 from coli- 
phage X whose product down-regulates transcription from 
P (and whose product is expressed constitutively) . This 
C I3 57 -mediated down-regulation of transcription from P R 

10 is heat-sensitive. The transcription regulated from this 
promotor is terminated with the aid of a so-called rho- 
independent transcription terminating sequence (forms a 
structure in DNA't which results in the DNA-dependent 
RNA-polymerase leaving the DNA-strand) which is placed 

15 in the vector immediately downstream of the multiple 

cloning sequence* The plasmid also carries the B-lacta- 
mase gene (from the plasmid pUC19) whose product permits 
ampicillin-selection of E. coli clones that have been 
transformed by the vector. 

20 

Selection of protein L-producing clones 

The transformed bacteria are cultivated, or cultured, on 
culture plates with an LB-medium which also contained 

25 ampicillin in a concentration of 100 jig/ml. Cultivation 
of the bacteria progressed overnight at 30°C, whereafter 
the bacteria were transferred to an incubator where they 
were cultivated for a further 4 hours at 42 °C. The 
plates were kept in a refrigerator overnight. On the 

30 next day, the colonies were transferred to nitrocellu- 
lose filters. Filters and culture plates were marked so 
as to enable the transferred colonies to be readily 
identified on respective culture plates. The culture 
plates were again incubated overnight at 30 °C, so that 

35 remaining rests of transferred bacteria colonies could 
again gr w. The plates were then kept in a refrigerator. 
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The bacteria in the colonies on the nitr cellulose- 
impressions were lysed by incubating the filter in 10% 
SDS for 10 minutes. Filters c ntaining lysed bacteria 
were then rinsed with a blocking buffer which comprised 
5 PBS (pH 7.2) with 0.25% gelatine and 0.25% Tween-20 

(four baths, 250 ml each at 37 °C) , whereafter the filter 
was incubated with radioactively marked (marked with 
125 I in accordance with the chloramin-T-method) Ig-K- 
chains (20 ng/ml in PBS with 0.1% gelatine). The incuba- 

10 tion took place at room temperature over a period of 3 

hours, whereafter non-bound radioactively marked protein 
was rinsed-off with PBS (pH 7.2) containing 
• 0.5 M NaCl, 0.25% gelatine and 0.25% Tween-20 (four 
baths, 250 ml each at room temperature). All filters 

15 were exposed to X-ray film. Positive colonies were 

identified on the original culture plate. Clones which 
reacted with Ig-K-chains were selected and analyzed with 
respect to the size on the DNA-fragment introduced in K 
the vector. One of these clones was selected for the 

20 production of protein L, pHDL. The DNA't introduced from 
this clone into plasmid pHD389 was sequenced. The DNA- 
sequence was found to be in full agreement with corre- 
sponding sequences (B1-B4 and 21 bases in B5) in the 
gene for protein L from Peptostreptococc us maanus ; 

25 strain 312. The size and binding properties of the 

protein produced by clone pHDL was analyzed with the aid 
of SDS-PAGE (see Figure 8), dot-blot experiment (see 
Figure 9) and competitive binding experiments. 

30 Production of protein L 

Several colonies from a culture plate with E. coli pHDL 
were used to inoculate a preculture (LB-medium with an 
addition of 100 mg/1 ampicillin) , which was cultured at 
35 28 °C overnight. On the following morning, the preculture 
was transferred t a larger volume (100 times the volume 
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of the preculture) of fresh LB-medium containing ampi- 
cillin (100 mg/1) and was cultured in shake-flasks (200 
rpm), (or fermentors) at 28 °C The culture temperature 
was raised to 40°C (induction of transcription) when the 
5 absorbency value at 620 nm reached 0.5. Cultivation then 
continued for 4 hours (applied solely to cultivation in 
shake-flasks) . Upon completion of the cultivation pro- 
" " cess, the bacteria were centrifuged down. The bacteria 
were then lysed with an osmotic shock method at 4°C 

10 (Dalboge et al. , 1989 supra) . The lysate was adjusted to 
a pH = 7. Remaining bacteria rests were then centrifuged 
down, whereafter the supernatent was purified on IgG- 
sepharose in accordance with earlier described protocol 
for protein G and protein L (U. Sjbbring, L. Bjorck and 

15 W. Kastern. 1991. Streptococcal protein G: Gene struc- 
ture and protein binding properties. J. Biol. Chem. 266: 
399-405; W. Kastern, U. Sjbbring and L. Bjorck. 1992. 
Structure of peptostreptococcal protein L and identifi- 
cation of a repeated. immunoglobulin light chain-binding 

20 doman. J. Biol. Chem. in-print. 

The expression system gave about 20 mg/1 of protein L 
when cultivation in shake-flasks. The culture was depos- 
ited at DSSM, Identification Reference DSSM E. coli 
25 LE392/pHDL. 
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Example 2 

Cloning and expression of prot ein LG 

5 construction of oligonucleotides (primers) for amplify- 
ing sequences which code for protein L6 

Protein L 

10 It has been found that a protein L-peptide (expressed in 
E. coli ) constructed of the sequence, ala-val-glu-asn- 
domain Bl (from protein L) will bind to the light chains 
of the immunoglobulins (Kastern, Sjobring and Bjorck, 
1992 , ' J. Biol. Chem. in-print). Since the affinity of 

15 this simple domain to Ig is relatively low (1x10 M 

1 ) and since the naturally occurring protein L, which is 
comprised of several mutually similar domains (B1-B5) 
has a higher affinity to Ig (1 x 10 10 m" 1 ) , four of 
these domains have been expressed together in the fol- 

20 lowing way: 

PL-N and PL-C2 are synthetic oligonucleotides (manufac- 
tured at the Biomolecular Unit at Lund University (Swe- 
den) in accordance with Applicant's instructions) which 
25 were used, with the aid of PCR (Polymerase Chain Reac- 
tion) to amplify a clonable gene fragment, called Bl-4, 
which codes for four Ig-binding protein L domains (ala- 
val-glu-asn-Bl-B2-B3-B4-lys-lys-val-asp-glu-lys-pro-glu- 

glu) : 

30 

PL-N : 5 ' -GCTCAGGCGGCGCCGGTAGAAAATAAAGAAGAAACACCAGAAAC-3 ' 

valgluasnlysglugluthrproglu 

P1-C2 : 5 ' -CAGCAGCAGC CATGGG TTCTTCTGGTTTTTCGTCAACTTTCTTA- 
35 3' 
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Amino acids have been shown under corresponding triplets 
in the coded strand. DNA-fragments which have been 
amplified with the aid of PL-N contain the recognition 
sequence for the restriction enzyme Hpall immediately 
upstream of the triplet which codes for. the first amino 
acid (val) in the expressed protein L-fragment. The 
fragment that has been cleaved with Hpall can be ligated 
with DNA (in this case > the used expression vector „, ..... 
pHD389) which has been cleaved with Sari. The construc- 
tion results in translation of an extra amino acid (ala) 
immediately upstream of the first amino acid in the 
protein L-fragment. The DNA-fragment that has been 
amplified with the aid of PL-C2 will contain the recog- 
nition sequence for the restriction enzyme Mool (empha- 
15 sized) immediately downstream of the sequence which 

codes for the last amino acid in the expressed protein 
L-fragment (glu) . Amplified fragments which have been 
cleaved with Ncol can be ligated to the Ncol-cleaved , 
PCR-generated protein-asp-CDC-met-f ragment (see below) . 



10 



20 



Protein 6 

It is known that a simple C-domain from protein G will 
25 bind to IgG (B. Guss, M. Eliasson, A. Olsson, M. Uhlen, 
A.-K. Frej, H. Jornvall, I. Flock and M. Lindberg. 1986. 
Structure of the IgG-binding regions of streptococcal 
protein G. EBBO. J. 5: 1567-1575). The strength at which 
a simple C-domain binds to IgG is relatively low 
30 (5 x 10 7 m" 1 ) . A fragment which consists of two C-do- 
mains with an intermediate D-region having a length of 
15 amino acids, however, has a considerably higher 
affinity to IgG (1.x id 9 M _1 ) . CDC-N and CDC-C are 
oligonucleotides which have been used as PCR-primers to 
35 amplify a clonable DNA-fragment, designated CDC, which 
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codes for two IgG-binding protein G-domains (pro-met- 
asp-CDC-met) . 



5 



CDC-N : GGSSST5GACACTTACAAATTAATCCTTAATGGT 
metaspthrtyr lys leui le leuasngly 



CDC-C : CS55T55A CTTATTA CATTTCAGTTACCGTAAAGGTCTTAGT 



Amino acids in the resultant sequence have been shown 
10 beneath the primer of the coding strand. DNA-f ragments 
which have been amplified with the aid of CDC-N contain 
the recognition sequence for the restriction enzyme Ncol 
(marked with a line above the sequence) - Cleaved ampli- 
fied fragments can be ligated with the fragment that has 
15 been amplified with the aid of PL-C2 and then cleaved 

with Ncol. The fragment will therewith be translated to 
the correct reading frame. DNA-f ragments which have been 
amplified with the aid of CDC-C will contain two so- 
called stop condons (emphasized) which terminate trans- 
20 lation. The recognition sequence for the restriction 
enzyme Sail (marked with a line above the sequence) 
follows immediately afterwards, this sequence also being 
found in the expression vector pHD389 (see Figure 1) . 

25 Those sequences which code for the binding properties of 
protein L (B1-B5) and for protein G (CDC) respectively 
contain no internal recognition sequences for the re- 
striction enzymes Hpall, Sail or Ncol. 

30 Amplif if ication and cloning procedures 



PCR (Polymerase Chain Reaction) was carried out in 
accordance with a protocol described by Saiki et al. , 
,1988; PCR was carried out in a Hybaid Intelligent Heat- 
35 ing-block (Teddington, UK) : 100 ^1 of the reaction 

mixture contained 50 mM KC1, 10 mM Tris-HCl f pH 8.3, 1.5 
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mM MgCl 100 Mg/ml gelatine, 300 M M with respect to 
each of 2 the deoxynucleotides (dATP, dCTP, dGTP, dTTP) , 
(Pharmacia) . In order to amplify sequences which code 
for the light-chain binding parts of protein L, there 
5 were added 20 pmol of each of the oligonucleotides PL-N 
and PL-C2, and 10 Ml of a DNA-solution which contained 
0.1 mg/ml of chromosomal DNA from Peptostreotococcus 
magnus, strain 312. By way of an alternative/ 20 pmol 
were added to each of the oligonucleotide pairs CDC-N 
10 and CDC-C and 10 Ml of a DNA-solutioii which contained 

0.1 mg/ml of chromosomal DNA from a group C streptococ- 
cus strain ( Streptococcus eguisimilis) called C40 
(U. Sjobring, L. Bjorck and W. Kastern. 1991. Strepto-v 
coccai protein G: Gene structure and protein binding 
15 properties. J. Biol. Chem. 266: 399-405 or with Ncol and 
Sail (10 U/Mg PCR-product) ,- (for CDC) at 37 °C. The thus 
amplified and subseqently cleaved DNA- fragments were 
then separated by electrophoresis in a 2% (weight by 
volume) agrose gel (NuSieve agarose, FMC Bioproducts) in 
20 a TAE-buffer (40 mM Tris, 20 mM aNa-cetate f 2 mM EDTA, 

pH 8.0) . The resultant fragments, 930 bp (for Bl-4) 
and 390 bp (for CDC) were cut from the gel. The 
concentration of DNA in the thus separated gel pieces 
was estimated to be 0.05 mg/ml. The agarose pieces cut 
25 from the gel and containing the cleaved, amplified 

fragments (Bl-4 and CDC) were melted in a water bath at 
65°C r whereafter they were allowed to cool to 37°C. 
10 Ml (0.5 Mg) of th i s DNA were transferred to a semi- 
microtube (Sarstedt) f preheated to 37 «c, whereafter 1 Ml 
30 of the vector pHD389 which had been cleaved with Narl 
and Sail were added. 1 pi 10 x ligase buffer (Promega) 
and 1 Ml T4 DNA-ligase (1 unit/Ml) were also added. The 
ligating reaction was permitted to take place at 37 °C 
for 6 hours. The cleaving and ligating conditions recom- 
35 mended by the producer of DNA-ligase and restriction 

enzymes (Promega) were followed in other respects. The 
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ligating reaction was then used to transform E. coli, 
strain LE392, which had been made competent in accor- 
dance with the rubidium-chloride/calcium-dichloride 
method as described by Kushner (1978) . Manipulation of 
5 DNA was effected in accordance with molecular biological 
standard methods (Sambrook et al., 1989). 

Expression system 

10 The vector pHD389 (see Figure 2) is a modified variant 

of the plasmid pHD313 (Dalboge et al., 1989). The vector 
which was replicated in E. coli (contains origin of 
replication from plasmid pUC19) is constructed such that 
. DNA-f ragments which have been cloned in the cleaving 

15 site for Narl will be expressed immediately after, or 

downstream, of the signal peptide (21 amino acids) from 
the envelope protein ompA from E. coli . Translation will 
be initiated from the ATG-codon which codes for the 
first amino acid (methionine) in the signal peptide. The 

20 construction with an E. coli -individual signal sequence 
which precedes the desired peptide enables the translat- 
ed peptide to be transported to the periplasmic space in 
E. coli . This is beneficial since it reduces the risk of 
degradation of the desired product through the intracel- 

25 lular occurrent enzymes of E. coli . Furthermore, it is 
easier to purify peptides which have been exported to 
the periplasmatic space. Unique recognition sequences 
(multiple cloning sequences) for several other restric- 
tion enzymes, among them EcoRI, Sail and BamHI are 

30 present immediately downstream of the Narl cleaving 
site. An optimized so-called Shine-Dalgarno sequence 
(also called ribosomal binding site, RBS) is found seven 
nucleotides upstream of the ATG-codon in the signal 
sequence from ompA, this optimized Shbine-Dalgarno 

35 sequence binding to a complementary sequence in 16S rRNA 
in the ribosomes and in a manner to decide that the 
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translati n is initiated in the c rrect place. The 
transcription of such DNA as that which is co-trans- 
cribed with the signal sequence for mpA is controlled 
by the P R -promotor from coliphage X. The vector also 
5 contains the gene for c.1857 from coliphage X, the prod- 
uct of which regulates-dpwn transcription from P R and 
the product of which is expressed constitutive ly. This 
cI857-mediated down-regulation of transcription from P R 
is heat-sensitive. Transcription which is regulated, or 

10 controlled f from this promotor will be terminated with 
the aid of a so-called rho-independent transcription 
terminating sequence which is inserted in the vector 
immediately downstream of the multiple cloning site. The 
plasmid also carries the gene for fi-lactamase (from the 

15 plasmid pUC19 ) , the product of which permits ampicillin- 
selection of E. coli clones that have been transformed 
with the vector. 

Selection of protein LG-produced clones 

20 

The transformed bacteria are cultivated on culture 
plates with LB-medium which also contained ampicillin in 
a concentration of 100 pg/ml. The bacteria were culti- 
vated overnight at 30°C, whereafter they were trans - 

25 f erred to a cultivation cabinet (42 °C) and cultured for 
a further four (4) hours. The plates were stored in a 
refrigerator overnight. On the following day, the colo- 
nies were transferred to nitrocellulose filters. The 
filters and culture plates were marked, so that the 

30 transferred colonies could later be identified on the 
culture plate. The culture plates were again incubated 
overnight at 30°C, so that rests of transferred bacteria 
colonies remaining on the plates could again grow. The 
plates were then stored in a refrigerator. The filter 

35 was incubated in 10% SDS f or 10 minutes, so as to lyse 
the bacteria in the c lonies on the nitrocellulose 
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rinsed with a blocking buffer c nsisting of PBS (pH 7.2) 

with 0.25% gelatine and 0.25% Tween-20 (four baths of 

250 ml at 37°C), whereafter the filter was incubated 

125 

5 with radioactively (marked with I according to the 
chloromine-T-method) marked Ig-*-chains (20 ng/ml) in 
PBS with 0.1% gelatine). The incubation process took 
place at room temperature for four (4) hours, whereafter 
non-bound radioactively marked protein was rinsed-of f 

10 with PBS (pH 7.2) containing 0.5 M NaCl, 0.25% gelatine 
and 0.25% Tween-20 (four baths, 250 ml each at room tem- 
perature) . All filters were exposed to X-ray film. 
Positive colonies on the original culture plate were 
identified. A number of positive colonies were re- 

15 cultivated on new plates and new colony-blot experiments 
were carried out with these plates as a starting materi- 
al with the intention of identifying E. coli colonies 
which bind IgG Fc. These tests were carried out in 
precisely the same manner as that described above with 

20 respect to the identification of E. coli -colonies which 
expressed Ig light-chain-binding protein, with the 
exception that a radioactively marked ( I) IgG Fc (20 
ng/ml) was used as a probe. Clones which reacted with 
both proteins were selected and analyzed with regard to 

25 the size of the DNA-fragment introduced in the vector. 

One of these clones was chosen for production of protein 
LG, pHDLG. The DNA't taken from this clone and intro- 
duced into plasmid pHD389 was sequenced. The DNA-se- 
quence exhibited full agreement with corresponding 

30 sequences (B1-B4 and 21 bases in B5) in the gene for 
protein L from Peptostreotococcus maanus , strain 312, 
and with C1DC2 sequence in group C streptococcus strain 
C40. The size and binding properties of the protein 
produced from clone pHDLG was analyzed with the aid of 

35 SDS-PAGE (see Figure 8), dot-blot experiment (see Figure 
10) and competitive binding experiments. 
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Pr ducti n f protein LG 

Several colonies from a culture plate with E. c li pHDLG 
were used to inoculate a preculture (LB -medium with an 
5 addition of 100 mg/1 ampicillin) were cultivated at 28°C 
overnight. In the morning, the preculture was trans^ 
f erred to a larger volume (100 times the volume of the 
preculture) of fresh LB-medium containing ampicillin 
(100 mg/1) and was cultivated in vibrating flasks (200 

10 rpm) , (or fermenters) at 28 °C. When an absorbence value 
of 0.5 was reached at 620 ran, the cultivation tempera- 
ture was raised to 40°C (induction of transcription) . 
The cultivation process was then continued for 4 hours 
..(applies only to cultivation in vibrated flasks). The 

15 bacteria were centrifuged down upon termination of the 

cultivation process. The bacteria were then lysed at 4°C 
in accordance with an osmotic shock method (Dalboge et 
al. , a989) . The lysate was adjusted to a pH of 7. Re- 
maining bacteria rests were centrifuged down and the 

20 supernatent then purified on IgG-sepharose, in accor- 
dance with the protocol earlier described with reference 
to protein G and protein L. (Sjobring et al. f 199i, 
Kastern et al. , 1992). 

25 The expression system gave about 30 mg/1 of protein LG 
when cultivation in vibrated flasks. A deposition has 
been made at DSStt, Identification Reference DSSM E. coli 
LE392/pHDLG. 

30 Example 3 

Analysis of the binding properties of protein LG 

Western Blot 

35 

Protein G (the ClDC2-fragment) 9 protein L (f ur B- 
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domains) and protein LG were isolated with SDS-PAGE (10% 
acrylamide c ncentration) . The isolated proteins were 
transfered to nitrocellulose membranes in three similar 
copies (triplicate) . Each of these membranes was incu- 
5 bated with radioactively marked proteins (20 ng/ml: one 
of the membrane-copies was incubated with human poly- 
clonal IgG, another with human IgG Fc-fragment and the 
third with isolated human IgG xchains. Non-bound radio- 
actively marked proteins were rinsed off and all filters 
10 were then exposed to X-ray film. 

Slot-blot 

Human polyclonal Ig-preparations and Ig-f ragments were 
applied with the aid of a slot-blot appliances on nitro- 
cellulose filters in given quantities (see Figure 10) on 
three similar copies. Each of these membranes was incu- 
bated with radioactively marked proteins (20 ng/ml). One 
of the membrane copies was incubated with protein LG, 
another with protein L and the third with protein G. 
Non-bound radioactively marked proteins were rinsed-off 
and all' filters were then exposed to X-ray film. 

The results are shown in Figures 9 and 10. 
25 

Other binding experiments have been carried out, with 
the following results: 
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TABLE 

Binding of the proteins G, L and LG to immunoglobulins. 



Binding protein: G K L K LG K 

a a 

Immunoglobulin 



Polyclonal igG* + 67 (10) +9.0 + 20 



IgG subclasses 



+ 2.0 + + 



4 

IgG fragment 



IgG 1 + 3.1 + * .+ 

IgG, + 6.1 + + 
3 

IgG + 4.7 + + 



Fc* + 6.0 (0.5) - 

F(ab') 2 * + 0- 4 CO- 2 ) + . + 



kappa 



+ 1.5 < + 



lambda - (") 



+ 11.6 .+ 
+10.4 + 

+ + 



Other Ig-classes 
IgM 
IgA 

IgE - 
IgD 

Oth^r ^pecjes: 
Polyclonal 

Monkey + + 

Rabbit IgG + 70 + 0-074 + 

IgG-Fc + 3.0 - + 

IgG-F(ab') 2 + 0.44 + 

Mouse + 41 + 2.6 + 

Rat +1.5 + 0.39 + 

Goat +14 ~ + 



+ 
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TABLE (cont'd. ) 

Binding of the proteins G, L and LG to immunoglobulins . 



Binding protein: G K L K LG K 

a a a 

Immunoglobulin 



Bovine IgC^ + 3 - + 

IgG 2 + 2 - + 

Horse + - + 

Guinea Pig + ■ + + 

Sheep + - + 

Dog + - + 

Pig + + + 

Hamster + 
Cat 

Hen - - 

Monclonals 

Mouse 

IgG, + , + \ + 

1 

IgG^ + + + 

2a 

lgc 2b + + 

lgG 3 + + 

igM - + +. 

igA - + + 

Rat 

IgG 2a + + 

IgG 2b 

IgG 2c 
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K = affinity constant (M _1 ). *The numerals within parenthesis 
"disclose the affinity of a recombinant protein G comprised of tw 
igG-binding domains. # A weak bond to lambda chains exists. 
& Binding to PI and PLG depends on the type of light chain of Ig. 

It will thus be seen that the synthesized hybrid protein LG has 
broad binding activity/specificity. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: HighTech Receptro AB 

(B) STREET: c/o Active, Skeppsbron 2 

(C) CITY: MALMO 

(E) COUNTRY: SWEDEN 

(F) POSTAL CODE (ZIP) : 211 20 

(G) TELEPHONE: 040/35 07 00 

(H) TELEFAX: 040/ 23 74 05 

(I) TELEX: 32637 Active S 

(ii) TITLE OF INVENTION: Hybridprotein 
(iii) NUMBER OF SEQUENCES: 1 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

. (C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0 f Version #1.25 



(EPO) 



(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: SE PCT/SE93/00375 
( vi ) PRIOR APPLI CATION DATA : 

(A) APPLICATION NUMBER: SE 9201331-7 

(B) FILING DATE: 28-APR-1992 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 305 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli LE392/pHDL, DSM 7054 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Ala Val Glu Asn Lys Glu Glu Thr Pro Glu Thr Pro Glu Thr Asp Ser 
1 5 10 15 

Glu Glu Glu Val Thr lie Lys Ala Asn Leu lie Phe Ala Asn Gly Ser 
20 25 30 

Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Lys Ala Thr Ser Glu 
35 40 45 , 



SUBSTITUTE SHEET 



WO 93/22342 PCT/SE93/00375 

36 

Ala Tyr Ala Tyr Ala Asp Thr Leu Lys Lys Asp Asn Gly Glu Tyr Thr 
50 55 60 

Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn lie Lys Phe Ala Gly 
65 70 75 80 

Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr He Lys Ala 
85 90 95 

Asn Leu He Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 
100 105 HO 

Thr Phe Glu Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Ala Leu 
115 120 125 

Lys Lys Asp Asn Gly Glu Tyr Thr Val Asp Val Ala Asp Lys Gly Tyr 
130 135 140 

Thr Leu Asn He Lys Phe Ala Gly Lys Glu Lys Thr Pro Glu Glu Pro 
145 150 155 160 

Lys Glu Glu Val Thr lie Lys Ala Asn Leu He Tyr Ala Asp Gly Lys 
165 170 , 175 * 

Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Glu Ala Thr Ala Glu 
180 185 190 

Ala Tyr Arg Tyr Ala Asp Leu Leu Ala Lys Glu Asn Gly Lys Tyr Thr 
195 200 205 

Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn lie Lys Phe Ala Gly 
210 215 220 

Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr He Lys Ala 
225 230 235 240 

Asn Leu lie Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 
245 250 255 

Thr Phe Ala Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Leu Leu 
260 265 270 

Ala Lys Glu Asn Gly Lys Tyr Thr Ala Asp Leu Glu Asp Gly Gly Tyr 
275 280 285 

Thr lie Asn He Arg Phe Ala Gly Lys Lys Val Asp Glu Lys Pro Glu 
290 295 300 

Glu 
305 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 921 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli LE392/pHDL, DSM 7054 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



GCGGTAGAAA 


ATAAAGAAGA 


AACACCAGAA 


ACACCAGAAA 


CTGATTCAGA 


50 


AGAAGAAGTA 


ACAATCAAAG 


CTAACCTAAT 


CTTTGCAAAT 


GGAAGCACAC 


100 


AAACTGCAGA 


ATTCAAAGGA 


ACATTTGAAA 


AAGCAACATC 


AGAAGCTTAT 


150 


GCGTATGCAG 


ATACTTTGAA 


GAAAGACAAT 


GGAGAATATA 


CTGTAGATGT 


200 


TGCAGATAAA 


GGTTATACTT 


TAAATATTAA 


ATTTGCTGGA 


AAAGAAAAAA 


250 


CACCAGAAGA 


ACCAAAAGAA 


GAAGTTACTA 


TTAAAGCAAA 


CTTAATCTAT 


300 


GCAGATGGAA 


AAACACAAAC 


AGCAGAATTC 


AAAGGAACAT 


TTGAAGAAGC 


350 


AACAGCAGAA 


GCATACAGAT 


ATGCAGATGC 


ATTAAAGAAG 


GACAATGGAG 


400 


AATATACAGT 


AGACGTTGCA 


GATAAAGGTT 


ATACTTTAAA 


TATTAAATTT 


450 


GCTGGAAAAG 


AAAAAACACC 


AGAAGAACCA 


AAAGAAGAAG 


TTACTATTAA 


500 


AGCAAACTTA 


ATCTATGCAG 


ATGGAAAAAC 


ACAAACAGCA 


GAATTCAAAG 


550 


GAACATTTGA 


AGAAGCAACA 


GCAGAAGCAT 


ACAGATATGC 


TGACTTATTA 


600 


GCAAAAGAAA 


ATGGTAAATA 


TACAGTAGAC 


GTTGCAGATA 


AAGGTTATAC 


650 


TTTAAATATT 


AAATTTGCTG 


GAAAAGAAAA 


AACACCAGAA 


GAACCAAAAG 


700 


AAGAAGTTAC 


TATTAAAGCA 


AACTTAATCT 


ATGCAGATGG 


AAAAACTCAA 


750 


ACAGCAGAGT 


TCAAAGGAAC 


ATTTGCAGAA 


GCAACAGCAG 


AAGCATACAG 


800 


ATACGCTGAC 


TTATTAGCAA 


AAGAAAATGG 


TAAATATACA 


GCAGACTTAG 


850 


AAGATGGTGG 


ATACACTATT 


AATATTAGAT 


TTGCAGGTAA 


GAAAGTTGAC 


900 


GAAAAACCAG 


AAGAATAATA 


A 






921 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 434 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unkn wn 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: NO 

(yi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli LE3 9 2 / pHDLG , DSM 7055 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ala Val Glii Asn Lys Glu Glu Thr Pro Glu Thr Pro Glu Thr Asp Ser 
1 5 10 15 

Glu Glii Glu Val Thr lie Lys Ala Asn Leu lie Phe Ala Asn Gly Ser 

, . 20 ' ' ] 25 " " 310 " • 

Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Lys Ala Thr Ser Glu 
35 40 45 

Ala Tyr Ala Tyr Ala Asp Thr Leu Lys Lys Asp Asn Gly Glu Tyr Thr 
50 55 60 

Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn lie Lys Phe Ala Gly 
65 70 75 80 

Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr lie Lys Ala 
85 90 95 

Asn Leu lie Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 
100 105 110 

Thr Phe Glu Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Ala Leu 
115 120 125 

Lys Lys Asp Asn Gly Glu Tyr Thr Val Asp Val Ala Asp Lys Gly Tyr 
130 135 140 

Thr Leu Asn lie Lys Phe Ala Gly Lys Glu Lys Thr Pro Glu Glu Pro 
145 150 155 160 

Lys Glu Glu Val Thr lie Lys Ala Asn Leu lie Tyr Ala Asp Gly Lys 
165 170 175 

Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Glu Ala Thr Ala Glu 
180 185 190 

Ala Tyr Arg Tyr Ala Asp Leu Leu Ala Lys Glu Asn Gly Lys Tyr Thr 
195 200 205 

Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn lie Lys Phe Ala Gly 
210 215 220 

Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr lie Lys Ala 
225 230 235 240 

Asn Leu lie Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 
245 250 255 

Thr Phe Ala Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Leu Leu 
260 265 270 
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Ala Lys Glu Asn Gly Lys Tyr Thr Ala Asp Leu Glu Asp Gly Gly Tyr 
275 280 285 

Thr lie Asn lie Arg Phe Ala Gly Lys Lys Val Asp Glu Lys Pr Glu 
290 295 300 

Glu Pro Met Asp Thr Tyr Lys Leu lie Leu Asn Gly Lys Thr Leu Lys 
305 310 315 320 

Gly Glu Thr Thr Thr Glu Ala Val Asp Ala Ala Thr Ala Glu Lys Val 
325 330 335 

Phe Lys Gin Tyr Ala Asn Asp Asn Gly Val Asp Gly Glu Trp Thr Tyr 

340 345 350 .. 

Asp Asp Ala Thr Lys Thr Phe Thr Val Thr Glu Lys Pro Glu Val lie 
355 360 365 

Asp Ala Ser Glu Leu Thr Pro Ala Val Thr Thr Tyr Lys Leu Val lie 
370 375 380 

Asn Gly Lys Thr Leu Lys Gly Glu Thr Thr Thr Lys Ala Val Asp Ala 
385 390 • 395 400 

Glu Thr Ala Glu Lys Ala Phe Lys Gin Tyr Ala Asn Asp Asn Gly Val 
405 410 415 

Asp Gly Val Trp Thr Tyr Asp Asp Ala Thr Lys Thr Phe Thr Val Thr 
420 425 430 

Glu Met 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1308 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichis coli L392/pHDLG, DSM 7055 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GCGGTAGAAA ATAAAGAAGA AACACCAGAA ACACCAGAAA CTGATTCAGA 50 

AGAAGAAGTA ACAATCAAAG CTAACCTAAT CTTTGCAAAT GGAAGCACAC 100 

AAACTGCAGA ATTCAAAGGA ACATTTGAAA AAGCAACATC AGAAGCTTAT 150 

GCGTATGCAG ATACTTTGAA GAAAGACAAT GGAGAATATA CTGTAGATGT 200 
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rTl/"» TV /"• 7V fTlTV TV TV 

TGCAGATAAA 


GGTTATACTT 


TAAATATTAA ATTTGCTGGA 


7V7\TV^7\7\T\TVTVTV 

AAAGAAAAAA 


OR A 


«TV /"^/"^TV t\ T\ ^* TV 

CACCAGAAGA 


ACCAAAAGAA 


GAAGTTACTA TTAAAGCAAA 


CTTAATCTAT 


*3 ft ft 

JUL) 


G CAG A 1 GG AA 


AAACACAAAC AGCAGAATTC AAAGGAXCAT 




Ten 


TV Tv^TN/""»/'"»7v/"»7V 7a 

AACAGCAGAA 


GCATACAGAT 


ATGCAGATGC 


ATTAAAGAAG 


bALAA x GG AG 


jinn 


AA1 A 1 ACAGT 


AGACGTTGCA 


GATAAAGGTT 


ATACTTTAAA 


t\ ^n^n tv t\ tv ^n^t^n 
1A1 1AAA1 11 


>i e n 


/^/■Vfl/y*7i TV TA. TV/* 

GCTGGAAAAG 


AAAAAACACC AGAAGAACCA AAAGAAGAAG 


^n^n 7v /'*fii 71 ^n^p 7v 7k 
11AL1A11AA 


DUU 


7v /^/itv t\ t\ nnwnik 

AGCAAACTTA 


ATCTATGCAG ATGGAAAAAC 


ACAAACAGCA 


f* H TV rnrnr* tv 71 » r» 
G AA 11 C AAAG 


ccn 
jjU 


GAACATTTGA 


AGAAGCAACA 


GCAGAAGCAT 


ACAGATATGC 


1 bALX 1A1 1A 


OUU 


/"»/■» 71 * 71 71 ^» 71 7i 71 

GuAAAAGAAA 


ATGGTAAATA 


TACAGTAGAC 


GTTGCAGATA 


71 7\ f* fprp 7V T 71 /"» 

AAbbl 1A1AL 


ODU 


1 1 1AAA1A1 1 


AAATTTGCTG 


GAAAAGAAAA 


AACACCAGAA 


O A TV 0/^7A 7A TV 7A 

bAALLAAAAb 


/ UU 


AAGAAG X 1AL 


TATTAAAGCA 


AACTTAATCT 


ATGCAGATGG 


AAAAACU.GAA 


/DU 


7A /17A /"»/-it\ /"•TV /rn 

Awib GAGAG 1 


TCAAAGGAAC 


ATTTGCAGAA 


GCAACAGCAG 


AAGCA1ACAG 


oUU 


A 1 AU3t iGAC 


TTATTAGCAA 


AAGAAAATGG 


TAAATATACA 


f+ O 7V ^* TV /KIWI TV /*' 

GCAGACT1 1AG 




AAGATGGTGG 


ATACACTATT 


AATATTAGAT 


TTGCAGGTAA 


TV IV IV STTim^* TV 

GAAAGTTGAC 


900 


OTA 71 71 7V 7\ /"'/"'TV 

GAAAAACCAG 


AAGAACCCAT 


GGACACTTAC 


AAATTAATCC 


nrimTV tv m/^^fTTTv tv 

TTAATGGTAA 


ft c 

950 


AAt-Al 1 GAAA 


GGCGAAACAA 


CTACTGAAGC 


TGTTGATGCT 


GCTACTGCAG 


^ /\/\rv 

1000 


71 71 7k 7v 71 ^*^R^i^n^n 


CAAACAATAC 


GCTAACGACA 


ACGGTGTTGA 


CGGTGAA1GG 


1050 


TV / »f IMII TV O/™* A f*f* 


ATGCGACTAA 


GACCTTTACA 


GTTACTGAAA 


AAUUAiaAAG 1 


11UU 


T^V 71 /"V* 
utilLuAlbUu 


TCTGAATTAA 


CACCAGCCGT 


GACAACTTAC 


AAAGl 1 Gl 1 A 


lien 


TTAATGGTAA 


AACATTGAAA GGCGAAACAA CTACTAAAGC 


AGTAGACGCA 


1200 


GAAACTGCAG 


AAAAAGCCTT 


CAAACAATAC 


GCTAACGACA 


ACGGTGTTGA 


1250 


TGGTGTTTGG 


ACTTATGATG 


ATGCGACTAA 


GACCTTTACG 


GTAACTGAAA 


1300 


TGTAATAA 










1308 
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Claims 

1. Protein L having the ability to bind t the light 
chains of immunoglobulins, characterized 
5 in that the protein L has the following amino acid 
sequence : 

. — B1 

Ala Val Glu Asn j Lys Glu Glu Thr Pro Glu thr Pro Glu Thr Asp Ser 
I | 5 10 15 

Glu Glu Glu val Thr lie Lys Ala Aan Leu He Phe Ala Asn Gly Ser 
20 25 30 

10 

Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Lys Ala Thr Ser Glu 
• 35 40 45 

Ala Tyr Ala Tyr Ala Asp Thr Leu Lys Lys Asp Asn Gly Glu Tyr Thr 
50 55 60 

15 Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn He Lys Phe Ala Gly 
65 B2 70 75 80 

iLys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr He Lys Ala 
! 85 90 95 

Asn Leu He Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 
20 100 105 11° 

Thr Phe Glu Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Ala Leu 
115 120 125 

Lys Lys Asp Asn Gly Glu Tyr Thr Val Asp Val Ala Asp Lys Gly Tyr 
130 135 1*0 

25 ■ 83 

Thr Leu Asn He Lys Phe Ala Gly ; Lys Glu Lys Thr Pro Glu Glu Pro 
145 150 ' 155 

Lys Glu Glu Val Thr He Lys Ala Asn Leu He Tyr Ala Asp Gly Lys 
165 170 I 75 

2 0 Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Glu Ala Thr Ala Glu 
180 185 19° 

Ala Tyr Arg Tyr Ala Asp Leu Leu Ala Lys Glu Asn Gly Lys Tyr Thr 
195 200 205 

Val Asp val Ala Asp Lys Gly Tyr Thr Leu Asn lie Lys Phe Ala Gly 

:3 ?jo 215 220 



SUBSTITUTE SHEET 



WO 93/22342 PCT/SE93/00375 

42 



10 



15 



20 



B4 



Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr lie Lys Ala 
225 230 235 

Asn Leu lie Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 



245 



250 



Thr Phe Ala Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Leu Leu 

: ... 260/ ; . ™* .„ 270 

Ala Lys Glu Asn Gly Lys Tyr Thr Ala Asp Leu Glu Asp Gly Gly Tyr 
275 280 — B5 285 

Thr lie Asn lie Arg Phe Ala Gly Us Lys Val Asp Glu Lys Pro Glu 
290 295 300 

Glu 



and variants, subf raginents , multiples or mixtures of the 
domains B1-B5 having the same binding properties. 

2. DNA-sequence, characterized in that 
it codes for the protein according to Claim 1 and has 
the following nucleotide sequence: 





GCG GTA GAA AAT AAA GAA 


GAA 


ACA 


CCA 


GAA 


ACA 


CCA 


GAA ACT 


GAT 


TCA 


^8 


25 


GAA 


GAA 


GAA 


GTA 


ACA ATC 


AAA 


GCT 


AAC 


CTA 


ATC 


TTT 


GCA AAT 


GGA 


AGC 


.96 




ACA 


CAA 


ACT 


GCA 


GAA TTC 


AAA 


GGA ACA 


TTT 


GAA 


AAA 


GCA ACA 


TCA 


GAA 






GCT 


TAT 


GCG 


TAT 


GCA GAT 


ACT 


TTG r AAG AAA GAC AAT 


GGA GAA 


TAT 


ACT 


192 


30 


GTA 


GAT 


GTT 


GCA 


GAT AAA 


GGT 


TAT 


ACT 


TTA 


AAT 


ATT 


AAA TTT 


GCT 


GGA 


240 


AAA GAA AAA ACA CCA GAA 


GAA 


CCA 


AAA 


GAA 


GAA 


GTT 


ACT ATT 


AAA 


GCA 


288 




AAC 


TTA 


ATC 


TAT 


GCA GAT 


GGA 


AAA 


ACA 


CAA 


ACA 


GCA 


GAA TTC 


AAA 


GGA 


336 




ACA 


TTT 


GAA 


GAA 


GCA ACA 


GCA GAA 


GCA 


TAC 


AGA 


TAT 


GCA GAT 


GCA 


TTA 


384 .* 


35 


AAG 


AAG 


GAC 


AAT 


GGA GAA 


TAT 


ACA 


GTA 


GAC 


GTT 


GCA 


GAT AAA 


GGT 


TAT 


432 

* 




ACT 


TTA 


AAT 


ATT 


AAA TTT 


GCT 


GGA 


AAA 


GAA 


AAA 


ACA 


CCA GAA 


GAA 


CCA 


480 




AAA 


GAA 


GAA 


GTT 


ACT ATT 


AAA 


GCA 


AAC 


TTA 


ATC 


TAT 


GCA GAT 


GGA 


AAA 


528 
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CAA 


ACA 


GCA 


GAA 


TTC 


AAA 


GGA 


ACA 


TTT 


GAA 


GAA 


GCA 


ACA 


GCA 


GAA 


5/6 




GCA 


TAG 


AGA 


TAT 


GCT 


GAC 


TTA 


TTA 


GCA 


AAA 


GAA 


AAT 


GGT 


AAA 


TAT 


ACA 




5 


GTA 


GAC 


GTT 


GCA 


GAT 


AAA 


GGT 


TAT 


ACT 


TTA 


AAT 


ATT 


AAA 


TTT 


GCT 


GGA 


£ *7 0 

671 




AAA 


GAA 


AAA 


ACA 


CCA 


GAA 


GAA 


CCA 


AAA 


GAA 


GAA 


/■rum 


AL1 


ATT 


AAA 


GCA 






AAC 


TTA 


ATC 


TAT 


GCA 


GAT 


GGA 


AAA 


ACT 


CAA 


ACA 


GCA 


GAG 


TTC 


AAA 


GGA 


768 


10 


ACA 


TTT 


GCA 


GAA 


GCA 


ACA 


GCA 


GAA 


GCA 


TAC 


AGA 


TAC 


GCT 


GAC 


TTA 


TTA 


816 






AAA 


GAA 


AAT 


GGT 


AAA 


TAT 


ACA 


GCA 


GAC 


TTA 


GAA 


GAT 


GGT 


GGA TAC 


864 




ACT 


ATT 


AAT 


ATT 


AGA 


TTT 


GCA 


GGT 


AAG 


AAA 


GTT 


GAC 


GAA 


AAA 


CCA 


GAA 


912 


15 


GAA 


TAATAA 




























921 






3. 


A hybrid protein, 


c h 


a r 


a c 


t e 


r i 


z e 


d 


in 







that it includes one or more of the Bl-B5-domains ac- 
20 cording to Claim 1 which bind to the light chains in 

immunoglobulins of all classes, and domains which bind 
to heavy chains in immunoglobulin G. ^ 

4. A hybrid protein according to Claim 3, c h a r - 
25 acterized in that the domains which bind to 

heavy chains in immunoglobulin G are chosen from among 
the Cl- and C2-dbmains in protein G or from among any 
other functionally similar proteins which bind to heavy 
chains in immunoglobulin G f and variants, subfragments, 
30 multiples or mixtures thereof having the same binding 
properties. 

5. A hybrid protein according to Claim 4, c h a r - 
,acterized in that the hybrid protein has the 

35 following amino acid sequence: 
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-'a Vai Glu Asn ivs Glu Glu Thr Pro Glu thr Pro Glu Thr Asp Ser 

. ... " 5 io 15 

5 c-lu Glu Glu Val Ihr He Lys Ala Asn Leu lie Phe Ala Asn Gly Ser 

20 25 30 

Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Lys Ala Thr Ser Glu. 

35 " ; r " ' 40 •■ - - " ■ ' 45 - •-■ 

Ala Tyr Ala Tyr Ala Asp Thr Leu Lys Lys Asp Asn Gly Glu Tyr Thr 
10 go 55 60 

Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn lie Lys Phe Ala Gly 



65 70 



75 



Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr He Lys Ala 



15 



85 90 9 5 



Asn Leu lie Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 

100 105 110 

Thr Phe Glu Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Ala Leu 
115 120 125 

Lys Lys Asp Asn Gly Glu Tyr Thr Val Asp Val Ala Asp Lys Gly Tyr 
130 135 140 

Thr Leu Asn lie Lys Phe Ala Gly Lys Glu Lys Thr Pro Glu Glu Pro 
145 150 155 160 

Lys Glu Glu Val Thr lie Lys Ala Asn Leu He Tyr Ala Asp Gly Lys 
25 165 170 175 



20 



Thr 



Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Glu Ala Thr Ala Glu 



180 185 

Ala Tyr Arg Tyr Ala Asp Leu Leu Ala Lys Glu Asn Gly Lys Tyr Thr 
195 200 205 

30 

Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn lie Lys Phe Ala Gly 



190 



210 215 



220 



Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr lie Lys Ala 
225 230 235 

35 Asn Leu lie Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 

245 . 1 250 " D 
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Thr Phe Ala Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Leu Leu 
260 265 270 

Ala Lys Glu Asn Gly Lys Tyr Thr Ala Asp Leu Glu Asp Gly Gly Tyr 
275 " 2S0 285 

Thr lie Asn He Arg Phe Ala Gly Lys Lys Val Asp Glu Lys Pro Glu 
250 295 300 



10 



15 



20 



25 



:-lu ?ro Met Asp Thr 7vr Lys Leu lie Leu Asr. Sly Lys Thr Leu Lys 

" 315 



3 05 



310 



Gly Glu Thr Thr Thr Glu Ala Val Asp Ala Ala Thr Ala Glu Lys Val 
325 330 335 

.Phe Lys Gin Tyr Ala Asn Asp Asn Gly Val Asp Gly Glu Trp Thr Tyr 
340 345 350 

Asp Asp Ala Thr Lys Thr Phe Thr Val Thr Glu Lys Pro Glu Val lie 
355 360 365 

Asp Ala Ser Glu Leu Thr Pro Ala Val Thr Thr Tyr Lys Leu Val 
370 375 380 

Asn Gly Lys Thr Leu Lys Gly Glu Thr Thr Thr Lys Ala Val Asp Ala 

395 400 



385 



390 



Glu Thr Ala Glu Lys Ala Phe Lys Gin Tyr Ala Asn Asp Asn Gly val • 

405 410 415 

Asp Gly Val Trp Thr Tyr Asp Asp Ala Thr Lys Thr Phe Thr Val Thr„ 
420 425 430 



Glu Met 



30 and variants, subfragments, multiples or mixtures of the 
domains B1-B5 having the same binding properties. 

6. DNA-sequence, characterized in that 
it codes for a protein according to Claim 5 and has the 
35 following nucleotide sequence: 
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GCG 


GTA 


GAA 


AAT 


AAA 


GAA 


GAA 


ACA 


CCA 


GAA ACA 


CCA 


GAA ACT 


GAT TCA 


48 


/ ♦ 




GAA 


GAA 


GAA 


GTA ACA 


ATC AAA GCT 


AAC 


CTA ATC 


TTT 


GCA AAT GGA AGC 


96 






ACA 


CAA 


ACT 


GCA 


GAA 


TTC 


AAA 


GGA 


ACA 


TTT GAA 


AAA 


GCA ACA 


TCA GAA 








GCT 


TAT 


GCG 


TAT 


GCA 


GAT 


ACT 


TTG 


AAG 


AAA GAC AAT 


GGA GAA 


m % m Ik. s*f 1 % 

TAT ACT 


*92 - 




5 


GTA 


GAT 


GTT 


GCA 


GAT 


AAA 


GGT 


TAT 


ACT 


TTA AAT 


ATT 


AAA TTT 


GCT GGA 


240 






AAA 


GAA 


AAA 


ACA 


CCA 


GAA 


GAA 


CCA 


AAA 


GAA GAA 


GTT 


ACT ATT 


AAA GCA 


288 ■ 






AAC 


TTA 


ATC 


TAT 


GCA 


GAT 


GGA 


AAA 


ACA 


CAA ACA 


GCA 


GAA TTC 


AAA GGA 


336 




10 


ACA 


TTT 


GAA 


GAA 


GCA 


ACA 


GCA 


GAA 


GCA 


TAC AGA 


TAT 


GCA GAT 


GCA TTA 


384 






AAG 


AAG 


GAC 


AAT 


GGA 


GAA 


TAT 


ACA 


GTA 


GAC GTT 


GCA 


GAT AAA 


GGT TAT 


432 






ACT 


TTA 


AAT 


ATT 


AAA 


TTT 


GCT 


GGA 


AAA 


GAA AAA 


ACA 


CCA GAA 


GAA CCA 


480 






AAA 


GAA 


GAA 


GTT 


ACT 


ATT 


AAA 


GCA 


AAC 


TTA ATC 


TAT 


GCA GAT 


GGA AAA 


523 . 




15 


ACA 


CAA 


ACA 


GCA 


GAA 


TTC 


AAA 


GGA 


ACA 


TTT GAA 


GAA 


GCA ACA 


GCA GAA 


576. 






GCA 


TAC 


AGA 


TAT 


GCT 


GAC 


TTA 


TTA 


GCA 


AAA GAA 


AAT 


GGT AAA 


TAT ACA 


624 






GTA 


GAC 


GTT 


GCA 


GAT 


AAA 


GGT TAT 


ACT 


TTA AAT 


ATT 


AAA TTT 


GCT GGA 


672 




20 


AAA 


GAA 


AAA 


ACA CCA 


GAA 


GAA 


CCA 


AAA 


GAA GAA 


GTT 


ACT ATT 


AAA GCA 


720 


■ : ' ■■ 




AAC 


TTA 


ATC 


TAT 


GCA 


GAT 


GGA 


AAA 


ACT 


CAA ACA 


GCA 


GAG TTC 


AAA GGA- 


768.;: : . 






ACA 


TTT 


GCA 


GAA 


GCA 


ACA 


GCA 


GAA 


GCA 


TAG AGA TAC 


GCT GAC 


TTA TTA 


816 




25 


GCA 


AAA 


GAA 


AAT 


GGT 


AAA 


TAT 


ACA 


GCA 


GAC TTA GAA GAT GGT GG& TAC 


864 ' 




ACT 


ATT 


AAT 


ATT 


AGA 


TTT 


GCA 


GGT 


AAG 


AAA GTT 


GAC 


GAA AAA 


CCA GAA 


912 


..■ . 




GAA 


CCC 


ATG 


GAC 


ACT 


TAC 


AAA 


TTA 


ATC 


CTT AAT 


GGT 


AAA ACA 


TTG AAA 


960 


' - : : „ 




GGC 


GAA 


ACA 


ACT 


ACT 


GAA 


GCT 


GTT 


GAT 


GCT GCT 


ACT 


GCA GAA 


AAA GTC 


1008 




30 


TTC 


AAA 


CAA 


TAC 


GCT 


AAC 


GAC 


AAC 


GGT 


GTT GAC 


GGT 


GAA TGG 


ACT TAC 


1056 






GAC 


GAT 


GCG 


ACT 


AAG 


ACC 


TTT 


ACA 


GTT 


ACT GAA 


AAA 


CCA GAA GTG ATC 


1104 






GAT 


GCG 


TCT 


GAA 


TTA 


ACA 


CCA 


GCC 


GTG 


ACA ACT 


TAC 


AAA CTT GTT ATT 


1152 






AAT 


GGT 


AAA 


ACA 


TTG 


AAA 


GGC 


GAA 


ACA 


ACT ACT 


AAA 


GCA GTA GAC GCA 


1 ZUU 




GAA 


ACT 


GCA 


GAA 


AAA 


GCC 


TTC 


AAA 


CAA 


TAC GCT 


AAC 


GAC AAC GGT GTT 


1248 






GAT 


GGT 


GTT 


TGG 


ACT 


TAT 


GAT 


GAT 


GCG 


ACT AAG 


ACC 


TTT ACG GTA ACT 


1296 






GAA 


ATG 


TAATAA 




















1308 





SUBSTITUTE SHEET 



WO 93/22342 



PCT/SE93/00375 



47 

7. DNA-sequence, characterized in that 
it codes for a protein according to Claim 3 and 4. 

8. DNA-seguehce, characterized in that 
5 it hybridizes to the DNA-sequence of Claim 2, 6 or 7 

under conventional conditions and codes for a protein 
which has the same binding properties as the protein 
according to any one of Claims 1 and 3-5. 

10 9. A plasmid vector, characterized in 

that it includes a DNA-sequence according to any one of 
Claims 2 and 6-8, preferably the vector pHDLG or pHDL. 

10. A host cell, characterized in that it 
15 is transformed with the hybrid plasmid according to 

Claim 9, in particular a host which belongs to the 
species E. coli . particularly E. coli LE392, or Bacilli^s 
subtilis. Saccaromvces cerevisiae , preferably Id. Ref . 
DSSM E. coli LE392 pHDL and E. coli LE392/pHDLG res- 
20 pectively. 

11. A method for producing a protein accbrding to 
Claims 1 and 3-5, characterized by cultiv^ 
ating a host cell according to Claim 10 under suitable 

25 conditions; accumulating the protein in the culture or 
lysing the cells and extracting the protein therefrom. 

12. A reagent apparatus for binding, separating and 
identifying immunoglobulins, characterized 

30 in that it includes a protein according to any one of 
Claims 1 and 3-5. 

13. A composition, characterized in that 
it includes a protein according to any one of Claims 1 

35 and 3-5, and optionally additives or carriers. 
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14. A pharmaceutical composition, charac- 
terized in that it includes a protein according 
to any one of Claims 1 and 3-5 , and optionally a pharma- 
ceutically acceptable carrier or extender. 
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■ AAGCT TAAGGAGGTTA ATCG ATG AAAAAA ACT_GCT ATC GCT ATC GCT GTT 
HA C mer - 

i f RBS 1 
n 1 a 
3 2 1 

GCT CTG GCT GGT TTC GCT ACT GTT GCT CAG GCG GCG CCG AGA TCT 

3T?\M B 

g 
l 



a 
r 
1 



AAA CAG GAA TTC GAG CTC GGT ACC CGG GGA TCC TCT AGA GTC GAC 



E S 

c a 

0 c 

1 1 



CTG CAG GCA TGC 
P S 



K 
P 
h 
1 



X 
m 
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B 
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x 
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S 
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MULTIPLE CLONING SEQUEN-i 
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TRANSCRIPTIONS 
TERMINATING 
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822^ SIGNAL PEPTIDE FOR THE 
SEQUENCE FROM OmPA 
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RIBOSOMAL AGGAGG 
BINDING SEQUENCE 

"RIGHT" PROMOTOR FROM COLIPHAGEA 



CI857 THE GENE FOR A HEAT-SENEITIVE REPRES- 
SOR-PROTEIN FROM COLIPHAGE A 
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F G 1 PLASMA pHD 389. THE RIBOSOMAL 
' BINDING-SEQUENCE (EMPHASIZED WITH 

A FULL LINE), THE SEQUENCE FOR SIGNAL PEPTIDE FROM ompA 
(FROM E.COll) (DOTTED LINE) AND RECOGNITION SEQUENCE FOR 
SEVERAL RESTRICTION ENZYMES ARE SHOWN. 
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FIG. 3 SCHEMATIC OVERALL VEIW OF THE PRODUCTION 
OF PROTEIN L 



CI857 




SEE FIG. 1 
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. \ Narl/BamHI 



\ 



JTGG 
^Hpall 




B5(21bp) 
Jl 



TAATAAGGATCC 
BamHI 



SUBSTITUTE SHEET 



PCT/SE93/0037S 



7/18 




TRANSFORM • 

T0 Leon 
strain \iwr FIG 3(2) 



SUBSTITUTE SHEET 



WO 93/22342 



PCT/SE93/00375 



'-S, 



8/18 

FIG.4 SCHEMATIC OVERALL VIEW OF PRODUCTION 
. OF PROTEIN LG 
PROTEIN L-GENE . 
(IN STRAIN Pm 312) 
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FIG. 5b SCHEMATIC OVERALL VIEW OF THE PRODUCTION 
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SCHEMATIC OVERALL VIEW OF THE PRODUCTION OF PROTEIN LH 
PROTEIN L-GENE 
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H OJ I— id |- (D <X >. H tt> <I-h <t^ <X >. rv. 

. . <r « . .<ir. hj . >-_j <r_i h«j uu cdcd ; ... <r_j 1 . 

<I ql <E«5 U fl) UCL <Xtn H U H L <X C <X 3 ID 

id 'i cd<i o<e o<r <t_i cd<i <icn UCD CDCD 

U C H C <I c hL CD j UL CD HJ <[j hi. 

Ul <x m <x-« cr >* <£-< U-C <E-< uu 

Uu. <E<T UCD hi- CD ID <XH CD<I CDCD HCf) 

C CD 3 ' <Illl Ul <X 3 H *S Cc <X 0> <El 

><X U) ><E >. ><I in ><!-. >U-« ><X-« >CD V. >U Q) 

f I<I CDCD <EJ tD<X CD tD CD<I UCD <I<t h(i) 
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